Data searching method and information data scrapping method using internet

ABSTRACT

A data search method comprises a search condition input step inputting search condition through a user terminal connected to an electric communication network; and a batch processing search step for performing search in a batch processing, wherein the batch processing step includes: a transmission subroutine for transmitting the search condition to one or more database servers having search engines through the electric communication network, a first reception subroutine for receiving one or more search condition through the electric communication network, and a second reception subroutine for receiving data associated with the search results through the electric communication network.

TECHNICAL FIELD

The present invention relates to a data search method and, moreparticularly, to a data search method for searching data throughinformation communication, in particular, the Internet.

BACKGROUND ART

With the development of the computer technology, the electriccommunication network represented by the Internet has made an influenceon the entire society. Most of things occurred off-line have transferredto the Internet, i.e., online world such that the Internet has becomeanother life.

For instance, generally, information must be collected from literatures,newspapers, magazines, etc. at a library.

However, it became possible to easily collect information by onlyinputting keywords associated with the information to find out throughthe computer or the terminal connected to the Internet nowadays.

The general online data search and collection will be described indetail hereinafter with reference to FIG. 1.

Firstly, a user accesses a web site (for example, a newspaper site, amagazine site, or a database site having a search engine) through auser's terminal at step S1. Here, the access means to establishconnection to the web site through which to perform search. Once theconnection to the desired site is established, the user inputs keywordsassociated with the contents to find out at step S2. That is, the userinputs the keywords in a key word input box. If the search is completedat step S2, a list showing the search results is displayed on a screenof the user terminal.

At step S4, the user checks the contents of the data linked to the listby clicking an item of the list displayed on the screen of the userterminal. In such a situation, the user can refer to the respective databy randomly clicking any one of the items on the list or clicking arelevant item. The user determines whether or not the item contains thecontents he wants to find out by reading the contents of the data linkedto the clicked item at step S5. If the item contains the information hewants to find out, the user copies the content using an input devicesuch as a keyboard or a mouse at step S6. The copied contents are pastedusing a word processor such as Hangul or MS word in the form of text soas to be edited by the user at step S7.

These procedures, i.e. step S4 to step S7, are repeatedly performed inorder. By doing this, the user can collect the information he wants, andedit the collected information as he wants. At step S8, then it isdetermined, by the user's intention, whether or not there are contentsto be checked. And then, it is determined whether or not to do the sameoperation at other search site at step S9. Consequently, the informationcollection operation is terminated if it is not required to search theinformation at other sites.

In this manner, the data taken through the above procedure are stored asimage or text files and managed, if it is required, using the wordprocessor with which the user is familiar.

However, there are some problems in this data collect operation. Amongthem, a critical problem is that it takes so long time for the datacollect operation. In fact, the time being elapsed for the onlinesearch, in consideration of presently wide spread ADSL environment orsuperior, is long, i.e. about 5˜10 seconds for access to the searchsite, about 5˜10 seconds for keyword input, about 2˜20 seconds forwaiting the results (including loading additional information such asvarious advertisements, associated link, or selection window), about 3˜5seconds for selecting and clicking a specific item, about 10˜20 secondsfor checking whether or not the contents of the selected item is useful,about 10 seconds for selecting and copying the contents if it is useful,and about 5 seconds for pasting the copyed contents as a word processordocument.

As described above, it takes so long time for the user to collect theinformation through the user terminal according to the conventionalprocedures. One reason of the time consuming is that the human, thenetwork, and the user terminal are functionally mixed such that it takeslong time for changing the main body of the operation. That is, theoperation is performed in an order of user's manipulation→waiting foraccess to the target site through the network→user'smanipulation→operation of the terminal→user's decision→user'smanipulation, etc.

Also, the second reason of the time consuming is that it takes long timeto completely load a web page containing about 40˜50 uselessadvertisements, links, or images as well as the useful data foridentifying the contents. Furthermore, this procedure should berepeatedly performed whenever the user tries to search the data at othersites.

Also, the conventional repeated information collecting procedure hasshortcomings in that it makes the user feel tedious as well as wastemuch time.

Also, some useful information can be missed or duplicated during therepeated procedures. In this case, unnecessary operation for searchingthe omitted information may be performed again. Also, these recursiveoperations make the user uncomfortable if it is repeated frequently ordaily.

Recently, metaengine softwares have been developed such that the aboveproblems are solved to some extent. However, these softwares mearly showthe functional level gathering the search results in one place. That is,the softwares provide the services to display only the Uniform ResourceLocators (URL: which is a form uniformly representing the resourceaddresses for accessing over the Internet) associated with the searchresults.

The Korean Laid-Open Patent 10-2001-10807 No. discloses a newsinformation scrap method and system using the Internet, in which theinteresting information such as articles of news papers, publicannouncements, advertisements, etc. with the sources are retrieved informs of image and text files through the Internet and the searchresults are stored in a database storage space for the user.

In this technique, however, it is required for the user to access andretrieve the search results from the storage space of the database inwhich the search results are stored when the user intends to see thescrapped information. This requires a unique server for the user.

Also, either of the Korean Laid-Open Patent Nos. 10-2001-102786 and10-2002-26082 discloses service for classifying, editing, and retrievinginformation in storage space such as scrap server, database, or thelike, in that the information collected and edited in the server ordatabase can be retrieved through the Internet. However, this techniquehas a shortcoming in that the collected information cannot be read in anoff-line state.

DISCLOSURE OF INVENTION

To solve the above problems, it is an object of the present invention toprovide a data search method capable of dramatically reducing the timerequired for collecting information.

It is another object of the present invention to provide a data searchmethod capable of efficiently collecting, analyzing, and managing thedata searched through an electric communication network, i.e., theInternet.

To achieve the above objects, the data search method according to thepresent invention comprises a search condition input step inputtingsearch condition through a user terminal connected with an electriccommunication network; and a batch processing search step for performingsearch in a batch processing, wherein the batch processing stepincludes: a transmission subroutine for transmitting the searchcondition to one or more database servers having search engines throughthe electric communication network, a first reception subroutine forreceiving one or more search results searched by the search engines ofthe database servers according to the search condition through theelectric communication network, and a second reception subroutine forreceiving data associated with the search results through the electriccommunication network.

Also, the present invention provides a computer program capable ofexecuting the above data search method.

Also, the present invention provides a storage medium for storing theabove computer program.

Also, the present invention provides a method for transmitting orreceiving the above computer program through an electric communicationnetwork.

Also, the present invention provides a method for scrapping informationdata using the Internet which comprises the steps of searching targetinformation by inputting keywords using a search function of a searchsite through a user computer with online connection; accessing a webserver of the search site through an HTTP protocol automatically set atthe user computer; transmitting a query for searching at the web serverof the connected search site; transmitting one or more search resultsretrieved at one or more database servers as results of the query whichis received by the web server; downloading the searched data through theHTTP protocol; removing unnecessary data among the downloaded data;storing the data remained after the unnecessary data are removed;editing, processing, and managing the data stored in a local storagemedium using a program included in the user computer.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flowchart illustrating a conventional data search methodthrough the Internet.

FIG. 2 is a block diagram illustrating a data search system according tothe present invention.

FIG. 3 is a flowchart illustrating a data search method according to thefirst embodiment of the present invention.

FIG. 4 a is a flowchart illustrating a server adding process of thesearch condition input step of the data search method in FIG. 3.

FIG. 4 b is a flowchart illustrating a batch processing search of thedata search method in FIG. 3.

FIG. 5 is a flowchart illustrating a data scrap method according to thesecond embodiment of the present invention.

FIG. 6 is a flowchart illustrating a stored data management process ofthe data scrap method in FIG. 5.

FIG. 7 is a conceptual view illustrating a window for displaying aprogram for executing the data search method and data scrap methodaccording to the present invention.

BEST MODE FOR CARRYING OUT THE INVENTION

The data search method and the data scrap method using the Internetaccording to the present invention will be described hereinafter withreference to the accompanying drawings.

To achieve the objects of the present invention, firstly, a function ofa batch processing for search is required in that the search isperformed at several search sites and the search results are shown atone sight. Secondly, a function for processing the search results suchthat the unnecessary data such as various banners and advertisementsthat delay loading of contents and cause problems for storing andmanaging the useful contents. Thirdly, it is required to quicklyidentify the contents even when the many results are searched so as toenhance the speed of data retrieval. That is, in case that thousands ofsearch results should be inspected, it takes a few seconds forinspecting each of search results in conventional data search technique,thus increasing time consumption. It is required to quickly inspect thecontents of the search results. Fourthly, it is required to facilitatethe data management such that the identified contents be easily managed.That is, the contents should be stored if those are useful, and on theother hand, the useless contents can be easily removed. Also, the storedcontents should be easily converted into a word processor documentformat. Fifthly, an automatic update function is required in that thesearched contents are periodically and automatically updated by user'sintension. Since recently the information rapidly changes, the storedinformation contents should be periodically updated so as to maintainthe value of the information. This increases the temporal, physical, andmental satisfactions of the user.

FIG. 2 is a block diagram illustrating a system for the data searchmethod and data scrap method according to the present invention, inwhich a data processing engine software installed in a local userterminal (personal computer), etc. connected to the Internet accesses aweb server through the Internet so as to collect the search results andstore the search results in a local storage medium (floppy disc, harddisc, compact disc, flash memory, etc.).

The user terminal 10 is a portable terminal such as a desktop computer,a portable computer, a personal digital assistants (PDA), a mobilehandset, etc. that can perform online communication through an electriccommunication network, such as the Internet. At the user terminal 10, adata processing engine software 12 should be installed. The dataprocessing engine software 12 may be a freeware, a shareware, or a paysoftware as an engine having functions searching data through theInternet and storing the data. Also, the data processing engine softwarehas a function converting the files downloaded and stored in a localstorage medium into one or more files and storing the converted files.The data processing engine software 12 is a computer program forexecuting the data search method and the data scrap method according tothe present invention.

An output device 20 is a device such as a monitor for displayingsearched data or input/output status of the input/output devices. Aninput device 30 is a device such as a keyboard and a mouse for inputtingsearch keywords and editing the searched results.

A storage device 40 is a floppy disc (FD), a hard disc drive (HDD), acompact disc (CD), or a flash memory for storing the data processingengine software 12 and the searched data, etc.

A web server or a database server 60 is a server for a web site, such asnewspaper or magazine site for providing various informations, which isconnected to the local user terminal 10 through the electriccommunication network, i.e., the Internet 50. The database server 60 maybe associated with a plurality of sub-database servers providing variousdata such as images and other informations. The database server 60 maypreferably include a search engine for searching data. The data storedin the database server 60 may be intellectual property informationrelated to patents (utility models), designs, trademarks, copyrights,etc., an internet shopping malls (price information, productsinformation), as well as newspapers and magazines.

The data search method according to the first embodiment of the presentinvention, as depicted in FIG. 3, comprises a search condition inputstep S100 inputting search condition through a user terminal 10connected to an electric communication network 50; and a batchprocessing search step performing search in a batch processing, whereinthe batch processing step includes: a transmission subroutine S210 fortransmitting the search condition to one or more database servers 60having search engines through the electric communication network 50, afirst reception subroutine S220 for receiving one or more search resultssearched by the search engines of the database servers according to thesearch condition through the electric communication network 50, and asecond reception subroutine S230 for receiving data associated with thesearch results through the electric communication network.

The search condition input step S100 may further include a serverselection step S110 for selecting the database server.

Also, in the server selection step S110, as depicted in FIG. 4 a, adomain address of the database server 60 or selecting one or moredatabase servers 60 from a server list, may be directly inputted.

Also, the server selection step S110 may further include a server addingstep S111 for adding the database servers 60 to the server list. Thedatabase server list may be stored as an additional file, communicatedbetween the users, and periodically updated.

The database server 60 may be selected using the server selection box orthe server selection popup menu.

The search condition may be inputted identical with the search engineinput condition of the database server 60 so that the user may easilyinput the search condition for search. Particularly, in case of databaseserver requiring a specific form, the search condition may be inputtedin the form identical with the form required by the search window of thedatabase server 60.

The search condition may be a keyword such as in the form of a word or asentence and may include temporal attributes so as to perform a specificsearch.

Also, the search condition may include a transmission search condition,which is transmitted to the search engine of the database server 60; anda required-data condition given to the data received at the secondreception subroutine S230.

The transmission search condition is the search condition used in thedatabase server 60, and the required-data condition is the searchcondition for selecting and processing the data searched by the databaseserver 60. Also, the required-data condition may be keywords capable ofclassifying the searched data, i.e. searching again in the searchresults S260.

The required-data condition may be a file type, a creation date, a textdocument without image, or the like that the user may optionally set.

The input type or form may differ from each other according to thedatabase servers. The transmission subroutine S210 may further include aconversion subroutine for converting the inputted search condition intoa form required by the search engine of the database server 60 such thatthe inputted search condition is converted into one which each databaseserver 60 requires for user's convenience. Of course, the conversionsubroutine may be preferably updated according to the status change ofthe corresponding database server 60.

The batch processing search step S200, as shown in FIG. 4 b, may furtherinclude a comparison/decision subroutine (S240) for determining whetheror not the data received at the second reception subroutine (S230)satisfies the search condition inputted at the search condition inputstep.

The batch processing search step S200 may further include a data storagesubroutine S250 for storing the data received at the second receptionsubroutine S230 in the user terminal.

In the data storage subroutine S250, the data received at the secondreception subroutine S230 is stored after being processed or theadvertisement parts of the data being removed. Also, in the data storagesubroutine S250, the data received at the second reception subroutineS230 may be stored after being edited in view of online attributes so asto be off-line used.

In the data storage subroutine S250, it is preferred that the receiveddata is stored in the user terminal 10 when the data differ from thepreviously stored data after being compared with each other anddetermined as such so as to prevent the duplicate data from beingstored.

Also, in the data storage subroutine S250, the data received at thesecond reception subroutine S230 may be stored after a predeterminedvalue, information on the database server which transmits the data, anda copyright of the data being added thereto.

On the other hand, the data search method according to the presentinvention may further comprise a processing step S300 for processing thedata stored in the user terminal 10 after the batch processing searchstep S200.

In the processing step S300, the received data are processed as beingconverted into an identical form, combined as one file, or editedaccording to the user-required condition.

The batch processing step S200 is repeatedly performed at preset timeintervals or in real time for reflecting changes in the data such as thedata being searched again or changed.

The search condition of the data search method according to the presentinvention may be set to include log-in information so as to access thedatabase server requiring log-in process when the database server 60requires the log-in process.

The database server 60 may include an intellectual property database, aninternet shopping mall database, an article database for newspapers andmagazines.

The database search method according to the present invention mayfurther include a web page displaying step for displaying a web pagecorresponding to a selected address. Also, the web page displaying stepmay further include a favorite registration step for storing the addressof a user's favorite web page or an address input step for inputting theaddress of the web page.

Particularly, with the web page displaying step, the user may search theweb page which the user wants to access together with a data search andcollection so as to increase the user's operation efficiency. Also, itis possible to directly access the database server 60 with the addressof the database server.

The database search method according to the present invention may beexecuted as a computer program capable of being executed in a computer,a portable terminal, etc. The computer program may be stored in variousstorage media such as a hard disc drive (HDD), a floppy disc (FD), aflash RAM, a CD, a DVD, etc. and may be transmitted to and received fromthe user's terminals or servers through the electric communicationnetwork.

On the other hand, the basic background technology of the secondembodiment of the present invention is a screen scrapping. Here, thescreen scrapping is a technique which reads the contents of the Internetweb site and extracts intended information from the contents.

For instance, with the screen scrapping, it is possible to read weatherinformation from a weather information provider site, articles from anews provider site, and securities information from a securitiesinformation provider site so as to use the information.

A data search and connection procedure executed based on the screenscrapping function according to the second embodiment of the presentinvention will be described with reference to FIG. 5.

At step S400, a search is performed by inputting keywords for variousintended informations using the search function of the search site (forexample, various information provider sites such as a newspaper site, adaily or a monthly magazine site) accessed by the user terminal 10connected online. For example, using the search function of thenewspaper site providing the news information through the onlineconnection, the intended contents are searched. At this time, it ispossible to provide an integral search function that can performsearching several sites at once using identical keywords.

After the step S400, the batch processing search step S500 installed inthe user terminal performs the following steps in a lump.

At step S511, the user terminal 10, as it is configured with a program,is automatically connected to the database server 60 of the search sitethrough the Internet with HTTP protocol.

The Hypertext Transfer Protocol (HTTP) is an application protocolassociated with a Transmission Control/Internet Protocol (TCP/IP)required for communicating files (text, graphic image, sound, video, andother multimedia files) over the web.

The user terminal transmits a search query to the database server of thesearch site at step S512 and the database server 60, in response to thesearch query, transmits the search results retrieved from one or moredatabase servers associated therewith to the user terminal 10.

The user terminal reads the actual contents using the received searchresults. That is, because most of the search results are Hyperlinksconnected to the actual contents. Accordingly, the method of presentinvention performs reading the actual contents using the searched linkinformation. During the reading operation, the screen scrappingtechnique is used. That is, the user terminal analyzes the linksconnected to the actual contents using the screen scrapping technology.At step S514, the searched data is downloaded by using the HTTPprotocol.

At step S515, from the downloaded information, unnecessary informationis removed. During this process, the read information is converted intoan appropriate form. The conversion to the appropriate form is performedthrough following processes.

By removing the unnecessary information, various advertisementinformation and unwanted links are removed, and the images associatedwith the information the online links thereof are converted intooff-line links. At this time, the link conversion is carried out asfollows.

A name of the actual image is extracted. For example, in case of a linkhttp://www.test.com/test.jpg, the file name “test.jpg” is extracted. Andthen a relative location of the image is added as a prefix of the nameof the image. At this time, the relative location may be a folder named“img”. That is, the file test.jpg has an off-line link img/test.jpg.And, the image file at the fixed link is downloaded into the “img”folder. In this manner, the local data including the image can becreated.

Also, the various HTML links are added as necessary information. Duringthe unnecessary information removal process, it is possible to removethe prefix and suffix of the link so as to remain the middle part of thelink. In some cases, the necessary tags, for example, the <html> tagrepresenting HTML document may be removed. So this important taginformation is added.

At step S516, the data from which the unnecessary information isremoved, is stored in a local storage device 40. That is, the processedinformation is stored in the local storage device 40 and the actualcontents are stored as in the form of individual files. And the linkinformation is stored in the database. By separating the contents fromlinks, the search speed is enhanced. Also, it is possible to minimizethe damage when a problem occurs in the database. Also, the individualfiles may be used independently.

At step S517, the information stored in the local storage device 40 isedited, processed, and managed by a program installed in the userterminal 10.

FIG. 6 is a flowchart illustrating a process managing the informationstored in the local storage device 40, at step S517. That is, theinformation stored in the local storage device 40 is read at step S520.Then, the contents of the read information are checked at step S521 anddetermined whether or not it is intended one at step S522. If thecontents are unnecessary, they are removed by using a removal key of theinput device 30 as at step S523 and S524. On the other hand, if thecontents are the intended one, it is determined whether or not there isunchecked information at step S525. The contents checking procedure ofsteps S522 to S525 is repeatedly performed.

On the other hand, it is determined whether or not to search otherregistered search sites at step S418 and the steps S411 to S417 arerepeatedly performed.

The processing order of the step S417 and S418 may be changed accordingto the user's intention. After the data stored in the storage medium isprocessed, it is possible to search other registered search sites andthen process the data stored in the storage medium.

The information stored during the above processes may be easily managedby the user with the removing and combining functions and the storedinformation may be easily stored and retrieved into and from otherstorage media with a backup function. Also, the information associatedwith a designated keyword may be automatically updated at predeterminedintervals, for user's convenience.

FIG. 7 shows a main screen of a program according to the presentinvention, in which the keywords selected by the user are listed on theleft side, search results corresponding to a specific keyword such as atitle, a newspaper company, a weather, etc. are displayed on the topright side, and detail information such as titles and related contentsof the article is displayed on the bottom side.

And on the bottom of the main screen, a window displaying a programexecution status is displayed. The program execution status includes awhole search status, a present site search status, a present sitestorage status, a present site, a number of data searched, etc.

And, it is possible to register a search keyword together with a searchtarget, search period, etc. The registered keyword may be removed andrecovered according to the user's intention.

The information search program according to an embodiment of the presentinvention can be utilized for a newspaper, for example Chosunilbo website, and shows the result as follows.

The search program showed the efficiency improvement, in the time takento search, of more than 500% search efficiency compared with that of theconventional search method in that the search operation is carried outby accessing the website, retrieving, and checking the contents.Particularly, the search method of the present invention has showed thebetter efficiency when the number of search results increases.

The search method is tested in an environment in that the user computerhas been running with the operating system of Windows 2000® andconnected to the Internet through a high-speed digital subscriber line(xDSL).

When the search is performed with a keyword “changup” in KoreanLanguage, about 6000 search results are retrieved. If these searchresults are checked with the conventional search method, the time takento check will be 5 seconds per each and the total 5 seconds×6000=8.3hours.

And the time taken to copy and store the intended data becomes 3˜4 timeslonger. Accordingly, at least more than 20 hours will be taken.

However, in case that the data processing engine software of the presentinvention, the time taken to process the 6000 search results is about20˜30 minutes (the time may change according to the status of high speedInternet) and the checking time become 1.5 seconds per each and 2 hoursand 30 minutes in total. Furthermore, since the checking, removing,storing processes are performed at the same time; there is no additionaltime for copying and storing the data. Accordingly, the total timerequired for the whole search process will become about 3 hours.

Objectively, the data search method of the present invention showssuperior temporal efficiency of 3 hours to the 20 hours of theconventional search method, i.e. improvement over 600% of temporalefficiency.

Also, in the present invention, other operations can be performed duringthe search operation such that the actual time taken for search can bemuch shorter than ever.

INDUSTRIAL APPLICABILITY

As described above, the information scrapping method using the Internetaccording to the present invention is practical in various fields andobjects and can be efficiently utilized for researching and storing dataregarding to the own brand products, competitor products, and markettrends at the planning and sales promotion departments of businesses.Also, the information scrapping method can be practically used by asales department for researching and storing the information on theclient companies, the business trends, and personnel, and also can beused for researching the business related information by an individualwho are planning to start business. Also, the method can be used by astock investor for gathering information on the stocks, he owns, such asbusiness news and trend of the company related to the stocks and thegeneral trend of the industry.

Also, in case of a student, the information scrapping method can beutilized for collecting various reports and articles or photographs ofentertainers he/she likes and for collecting the data related to hishobbies and health.

Furthermore, according to the present invention the web documentssearched by the data processing engine software can be compressed in aminimal form and then stored in the local storage medium such that it ispossible to retrieve the stored data regardless of the online connectionand minimize the time required for searching and checking the data.Also, since the data are stored after being minimized in size it is easyto manage the data by deleting and combining the same.

1. A data search method comprising: a search condition input stepinputting search condition through a user terminal connected to anelectric communication network; and a batch processing search stepperforming search in a batch processing, wherein the batch processingstep includes: a transmission subroutine for transmitting the searchcondition to one or more database servers having search engines throughthe electric communication network, a first reception subroutine forreceiving one or more search results searched by the search engines ofthe database servers according to the search condition through theelectric communication network, and a second reception subroutine forreceiving data associated with the search results through the electriccommunication network.
 2. The method of claim 1, wherein the searchcondition input step further includes a server selection step forselecting the database server.
 3. The method of claim 2, wherein, in theserver selection step, a domain address of the database server isdirectly inputted.
 4. The method of claim 3, wherein, in the serverselection step, one or more database servers from a server list areselected.
 5. The method of claim 3, wherein the server selection stepfurther includes the step for adding the database servers to the serverlist.
 6. The method of claim 1, wherein, in the search condition inputstep, the search condition is inputted corresponding to the inputcondition required for the search engine of the database server.
 7. Themethod of claim 1, wherein the search condition is keywords.
 8. Themethod of claim 1, wherein the search condition includes timeattributes.
 9. The method of claim 1 or 6, wherein the search conditionincludes: a transmission search condition that is transmitted to thesearch engine of the database server; and a required-data conditiongiven to the data received at the second reception subroutine.
 10. Themethod of claim 9, wherein the required-data condition includes filetype and a creation date of the data.
 11. The method of claim 1, whereinthe transmission subroutine further includes a conversion subroutine forconverting the inputted search condition so as to have a type requiredfor the search engine of the database server.
 12. The method of claim 1,wherein the batch processing search step further includes acomparison/decision subroutine for determining whether or not the datareceived at the second reception subroutine satisfy the search conditioninputted at the search condition input step.
 13. The method of claim 1,wherein the batch processing search step further includes a data storagesubroutine for storing the data received at the second receptionsubroutine in the user terminal.
 14. The method of claim 13, wherein, inthe data storage subroutine, the data received at the second receptionsubroutine, is stored after being processed.
 15. The method of claim 13,wherein, in the data storage subroutine, the data received at the secondreception subroutine, is stored after being removed an advertisementpart from the received data.
 16. The method of claim 13, wherein, in thedata storage subroutine, the data received at the second receptionsubroutine, is stored after being editing online elements from thereceived data so as to be used in off-line.
 17. The method of claim 13,wherein, in the data storage subroutine, the received data, is comparedwith the previously stored data and is stored when the received datadiffers from the previously store data.
 18. The method of claim 13,wherein, in the data storage subroutine, the data received at the secondreception subroutine is stored after being added a presset value. 19.The method of claim 18, wherein, in the data storage subroutine, thedata received at the second reception subroutine, is stored after beingadded database server information associated with the database servertransmitted the data and copyright information of the data.
 20. Themethod of claim 1, further comprising a processing step for processingthe data stored in the user terminal after the batch processing searchstep.
 21. The method of claim 20, wherein the data is converted to anidentical form at the processing step.
 22. The method of claim 20,wherein the received data is combined as one file in the processingstep.
 23. The method of claim 1, wherein the batch processing step isperiodically repeated at preset time intervals.
 24. The method of claim1, wherein the batch processing step is repeated in real time.
 25. Themethod of claim 1, wherein the search condition includes log-ininformation for accessing the database server requiring a log-inprocess.
 26. The method of claim 1, wherein the database server is anintellectual property database server.
 27. The method of claim 1,wherein the database server is an Internet shopping mall databaseserver.
 28. The method of claim 1, wherein the database server is anarticle database server.
 29. The method of claim 1, further comprising aweb page display step for displaying a web page corresponding to theselected domain address.
 30. A computer program being executable inaccordance with the methods of claim
 1. 31. A storage medium for storingthe computer program of claim
 30. 32. A method for transmitting andreceiving the computer program of claim 30 through an electriccommunication network.
 33. A method for scrapping using the Internetcomprising: searching target information by inputting keywords using asearch function of a search site through a user computer with onlineconnection; accessing a web server of the search site through an HTTPprotocol automatically set at the user computer; transmitting a queryfor searching at the web server of the connected search site;transmitting one or more search results retrieved at one or moredatabase servers as results of the query which is received by the webserver; downloading the searched data through the HTTP protocol;removing unnecessary data among the downloaded data; storing the dataremained after the unnecessary data are removed; editing, processing,and managing the data stored in a local storage medium using a programincluded in the user computer.
 34. The method of claim 33, wherein theprogram (data processing engine software) of the user computerautomatically and periodically updates the data associated with a searchword designated by the user.
 35. The method of claim 33, wherein theunnecessary data is various advertisements data and unnecessary links.36. The method of claim 33, wherein image data link conversions areperformed in such a way that in case of images associated with thecontents the online links are converted into off-line links.
 37. Themethod of claim 33, wherein the searched data is any one of onlinenewspaper, magazine, and web document.
 38. The method of claim 33,further comprising the step of minimizing storing time and space byremoving the unnecessary tag parts and storing necessary parts from thedownloaded data.
 39. The method of claim 33, wherein the program (dataprocessing engine software) included in the user computer automaticallyconverts the contents of the downloaded and stored HTML document forusing the additional data such as images at the local storage medium.40. The method of claim 33, wherein the program (data processing enginesoftware) included in the user computer converts the files downloadedand stored in the local storage medium into one or more files and thenstores the same.
 41. The method of claim 33, wherein the local storagemedium is any one of a floppy disc, a hard disc, a compact disc, and aflash memory.